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(57) Abstract 

A method for efficiently screening full-length cDNA clones which comprises: determining the base sequence in the 5'-region of each 
clone contained in a cDNA library prepared by a method for constructing a cDNA library involving full-length ones at a high ratio; 
examining the presence/absence of initiation ATG in this 5'-region and the location thereof by using an originally developed software for 
anticipating initiation codons in cDNA; thus exactly judging the presence/absence of the initiation codon and the location thereof; and 
screening the cDNAs thus judged as carrying the initiation codon from the cDNA library. Moreover, a cDNA library containing full-length 
ones at an extremely high ratio can be constructed by mixing the clones thus selected above. 
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($&£MtL~C\&, I"a. G. Pedersen, Proceedings of fifth international c 
onference on intelligent systems for molecular biology, p226-233 (1997), 
held in Halkidiki, Greece, June 21-26, 1997. j tfi^f » ^3rV>£ 
MJ-r^rD^^A(CO^Ttt^^tLT§T{±l>a^ ( TQene Finderj V. V. 
Solovyev et-al., Nucleic Acids Res., 22, 5156-5163 (1994). > TGrailj Y. 
Xu et al., Genet-Eng-N-Y. , 16, 241-253 (1994)) , £CD7n 

?lts $t±&m<DM^cdM^ J 77 U -{1^i£T*f1^ LfccDNA^ ^ 77 U-(C 
^£ft3^D->CD5'MM©i&M?U£&;£U ClOF^^tfcV^ b£#iJfflLT 

a^Wtzti, £1% *U=f*-^ * 7&-C±:ScDNA7>f 77 U— £{15S£U cDNA 
7^77 U— {C^^n^^<o^cD^ p — >(zo^T^©5 , ^WiJ©^iE?U$: 

^^(Z*^0^#^(i N tg(MlfeV7 r#HiRII*&=3 r*>©#£©*iJlft*5 
ct^^ttfl^^fiitC^-r^Clil^T-^s u©V7 r£«k&»3R|§*&=i 
#STS^*iJ^$tlfe^D->^cDNA7>f 77 U— A*t>31^-r*l« N 
^:^©cDNA^D->$:MtRr§^^* s nI^T*feS^i:^B^^bfeo * 
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£ftfcfc©-e&£ s (4) {cia«©^^ 



BNSDOCID: <WO 9920750A1 I > 



WO 99/20750 




PCT/JP98/04772 



(7) (4) (3|3^©^(Cj;D^LaScDNA7^^U-, 

D->£?g£rr3 c ht: J; D^±fi*ft^F &fcS5«>fccDNA7>f 'J— £f«?- 

NA^D — >£3IiR-f3#&&, (a) cDNA^ 4 ^=7 U — (c^^ ti^cDNA^ p-> 
©5'5fcffifr&Jfi£ia?y£ifc£T3X«, (b) ifc££*ifcJtt*E?!l#fc:BI*&=i K 
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— 38r^T6-=481-(-1993)., 603-607 (1996).. 

M. Maruyama and S. Sugano, Gene, 138, 171-174 (1994). k V y T'SS 

hOkayama-Berg&^^^-y-T^^LfcS&^^-'J ^r^(S. Kato et a 

1., Gene, 150, 243-250 (1994)., MM • ^¥6-153953 19944f6J33Bk 
CAPldDNA'J % V >£-fb3N££r&(N. Merenkova and D. M. 

Edwards, W0 96/34981 Nov. 7, 1996.), CAPCDtr^M^fBtC^S** y7ft¥ 
^^(P. Carninci et al., Genomics, 37,327-336 (1996)., P. Carninci et 
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nZm^Zfim (I.Edery et al. ,MCB, 15, 3363-337K 1995 ) ) s Cap Switch olig 
onucleotide&T'ifc&Cap FinderClck tft 

cDNA^>f r7U-iP&©cDNA^D->0**H: x (J.Sambrook,E.F.Frits 
ch & T.Maniatis,Molecular Cloning Second edition,Cold Spring Harbor Labo 
ratory Press 1989) fc^fciBftOtt&fc* t)fr-5£fc#T£*o 

^LTfcfc, *«»H#&CJ:t)IB**nfcaiBr*ftMl«lt:|B«CD 
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70W±T^D-><&M^1-^h^u->©^ SW7 7t ^37*0.80W±T^ 
P - > 5h^n ~ >©^±S*tt81Jk X =1 7* £0. 90&±t- * d - > £ £ 
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JIttWfctt, A) »BLfcATG©sr«©-3tttH«* (ftl0<-ttl00tt£) ©* 

ATODSOU&ftftS?) »*tl«6*»A*©i|*fl|«u ©ATG©^ 
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IZXK) y^T^ 5. KDNAfcttmbfco IftWCIi^ t h fl£Sg:fc «fc Vt: htg#«Blfla (t 
eratocarcinoma NT-2 N neuroblastoma SK-N-MC) ck X> ~%W, (J.Sambrook.E.F.Fri 
tsch & T.Maniatis, Molecular Cloning Second edition, Cold Spring Harbor La 
boratory Press 1989) mm&fimz £ t)mRNA£}ftm Ltz<> XVdHr^y 

2) (ilh2©^) s Sfcii7>^A7W-77^7- m&m^: 

3) (g3i:I4©l^) &ffll>T:» (^-fg, Mi? H^, 41, 
603-607 (1996). p606. Y.Suzuki et al. , Gene, 200, 149-156(1997)) (Df5,Mlzifc\,\ 
WWM, TAPM-H. RNA^ ri-^-^s m~mc\)M(D&m£Mk<Dm3k*'i : f-3 tzo 
tf^T's PCRtCcfc i?2^cDNAtc^^L. SfiH2J»rLfco tf^-£% Dral IIT'ii0»T t 
fc^?*-pME18SCG. pMFL^ (@K • ^LlU, I6S Mlt MM, 38, 472-481 (1 
993).p480) lCcDNA©#ftt££&#>T^D-- >^Lfco CI HJC £ 19 f#£DNA(zo 
UTDNA-> — ^>i/>^lii:^(DyeTerminator Cycle Sequencing FS Ready Reacti 
on Kit, PE Applied Biosystems)£ffl^T^ — a-T)]/lz Lf:ibt>>- 
^Si&f, DNAv-^>i^-(ABIPRISM 377, PE AppliedBiosystems)T-DNAi&Sia 
?'i£8?#Tbfco o — >CD5' 3fc<£ t> n ^*L^ni[a]©<frDNAIE£'J£»rL&o 

— >©DNAia^J(C-o^Ts lfii#gLfccDNA©®JlR§{3#i:3 K>Mi|:7*n^A 
(ATGpr) tr^iRIH^n K >##£t-£frgfrtc:o£#p#r bfeo £©J§?#t7*o^ 

— mm^^Mx^t^ 

t&T— X-m*n©^ o — > (F-NT2RP 1000020. F-NT2RP 1000025, F-NT2R 
P1000039, F-NT2RP1000046) {cov>TCDMW^m*J^TtC^-r (^ 1 ) o F-NT2R 
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P1000020(880 bp)(i, 88-690tt# r Huma n neuron-specific gamma-2 enolasej 
(GenBank accession No.M22349)fctttffi[£Uf$#965L F-NT2RP1000025(645 bp)fi N 
29-641&# THuman alpha-tubulin mRNAj (GenBank accession No.K00558) izM 
L®m&m7%, F-NT2RP1000039(820 bp)&, 12-820f£# ^man mRNA for elon 
gation factor 1 alpha subunit(EF-l alpha)j (GenBank accession No. X0355 
8)fc#LfflRltt#g6Jk F-NT2RP1000046(788 bp)H:3-788<ft# r Huma n M2-type py 
ruvate kinase mRNAj (GenBank accession No.M23725)(c*f Uffi|5Jt4^97%T-fe^,„ 
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F-NT2RP1000122) fc:^T©#ffi*gm£Ja"F£^-r (3t2) 0 F-NT2RP1000013(6 
08 bp)&l-606jiE# r Human nuclear matrix protein 55 (nmt55) mRNAj (GenBan 
k accession No. U89867){c*t LfSIHR£#97% N F-NT2RP1000054(869 bp)(±l-869fe 
ifi r Human signal recognition particle (SRP54) mRNAj (GenBank accession N 
o. U51920)t^Lffi[5H47b s 96%. F-NT2RP1000122(813 bp)ttl-813(4# rn.sapien 
s mRNA for 2-5A binding proteinj (GenBank accession No.X76388)(z^tb1ifSl 
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ATG 


ATG© ATGpr 


ATG© ATGpr 


No. 






1 


85 <0.94> 


120 <0.94> 


2 


208 0.26 


187 0.54 


3 


386 0.05 


312 0.06 


4 


518 0.09 


388 — Orttf 


5 


545 0.05 


445 0.05 



^3^^-T<fcotF-ZRV6C1000408 N F-ZRV6C 1000454, F-ZRV6C1000466, F-ZRV6 
C1000615, F-ZRV6C1000670I;:oim:&, *n^*l85&, 107& N 162£u 85ft, 12 
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Ote©i&S r Aj frt>!&$L2> r ATGj b* tmfeZ ft tz 0 ZCDtzfrZ 

ft (±> Z u — > Ji^ftcDNA ^ p - > T* & & h ¥«J »r L tz 0 

£fcs B?#T£fTo£^D-->©5-£ N S?tRH3#;3 vyftft&Lte^t^mzft 
tz&MW-y (F-ZRV6C1001410, F-ZRV6C1001197. F-ZRV6C1001472) (30V>T 
cDg?*f^m*WT(3^1- (^4) o Zftt>?v-><DV3iii%6m<Di&mmm 
ZZft^ftmvm^ : 1 6s 17, 18£^-To 



F-ZRV6C1001410 



F-ZRV6C1001197 



ATG 


ATG© 


ATGpr 


ATG© 


ATGpr 


No. 


&g 




tea 


XZiT 


1 


23 


0.05 


5 


0.24 


2 


31 


0.07 


141 


0.25 


3 


71 


0.06 


202 


0.05 


4 


178 


0.05 


219 


0.05 


5 


214 


0.05 


228 


0.05 



6 
7 

9 

10 



F-ZRV6C1001472 
ATG© ATGpr 



teg 
77 
126 
149 
194 
213 
249 
338 

—344- 



351 
365 



0.25 
0.05 
0.05 
0.05 
0.22 
0.05 
0.09 
— Qv05= 



0.05 
0.05 



^4yb^s-Tcfcot3F-ZRV6C1001410, F-ZRV6C1001197. F-ZRV6C1001472fc:-D^T 
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1 . ^££cDNA*P->£jS*RT*#arefcoT\ 

(b) (a) T?fc££nfcJttM*J*fc:Htt&=i K >#^£ftT^S 18 

(c) (b) -egBJS^ K>* s ^*nTv%si:*j^$nfc^D— >saiRr^>xg N 

2 . cDNA^ -f 7*^ 'J -#S5£ga|RCDSG^cDNA-7 ^ U «fc D flMK£ 

3 . cDNA v -f ^ U— 4%BNACDCAP*Ufll||-rftXSfe$tr^jSt:J: Dfls»£ftfc 
4. ^ScDNA^-f 7^ 'J-^HtSAiStfeoT, 

(a) cDNA-^^f 7*7 'J — C^SftacDNA* D — >CD5' *i8*»e>JgMie?!l£&^-r 

(b) (a) T-^^ti^JSSIB^J^tc^P h'^^^tlT^S^S^^s HI 

(c) (b) -r*iift&=i K>^stiTi^*i:*ij^$nfc^D— >%a^-rsxjBx 

(d) ( G-)--£jgj ft$ frl fc j^-4^il^r-5-ia 

5 . cDNA-7 >f ^ U -*s^±6^©ffiV>cDNA7 -flsSi&C £ 

6 . cDNA-7-r 7^ U-tfBRNACDCAP^^fti-rSXgS^tf^iSti DflUBteilfc 
7. »}R«4t:IB«©*ast:A •D^LaScDNA^'T^ U-o 

BNSDOCID: <WO 9920750A1 I > 



WO 99/20750 




PCT/JP98/04772 



SEQUENCE LISTING 

<110> Helix Research Institute, Inc. 
tti^^tt^ ij y xffiZzpft 

<120> The method for selecting full length cDNA clones 

<130> H1-806PCT 

<150> JP 09-289982 
<151> 1997-10-22 

<160> 18 

<170> Patentln version 2.0 

<210> 1 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligo-capping linker sequence 



BNSDOCID: <WO 9920750A1 I > 



WO 99/20750 




PCT/JP98/04772 



2/14 



<400> 1 

AGCAUCGAGU CGGCCUUGUU GGCCUACUGG 30 

<210> 2 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligo(dT) adapter primer sequence 
<400> 2 

GCGGCTGAAG ACGGCCTATG TGGCCTTTTT TTTTTTTTTT TT 42 

<210> 3 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Random adapter primer sequence 
<400> 3 

GCGGCTGAAG ACGGCCTATG TGGCCNNNNN NC 32 
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<210> 4 

<211> 880 
<212> DNA 



<213> Homo 


sapiens 










<400> 4 












ATGCGCCCGC 


GCGGCCCTAT 


AGGCGCCTCC TCCGCCCGCC 


GCCCGGGAGC 


CGCAGnrnrr 


fin 


GCCGCCACTG 


CCACTCCCGC 


TCTCTCAGCG CCGCCGTCGC 


CACCGCCACC 


GPPAPTRPPA 


1 9ft 


CTACCACCGT 


CTGAGTCTGC 


AGTCCCGAGA TCCCAGCCAT 


CATGTCCATA 


GAGAARATPT 


lou 


GGGCCCGGGA 


GATCCTGGAC 


TCCCGCGGGA ACCCCACAGT 


GGAGGTGGAT 


CTPTATAPTH 


94.n 


CCAAAGGTCC 


TTTCCGGGCT 


GCAGTGCCCA GTGGAGCCTC 


TACGGGCATC 


TATGAfifiPPP 


ouu 


TGGAGCTGAG 


GGATGGAGAC 


AAACAGCGTT ACTTAGGCAA 


AGGTGTPPTfi 


AAuuLAuluu 


ob u 


ACCACATCAA 


CTCCACCATC 


GCGCCAGCCC TCATCAGCTC 


AGGTCTCTCT 


GTGGTGGAGC 


420 


AAGAGAAACT 


GGACAACCTG 


ATGCTGGAGT TGGATGGGAC 


TGAGAACAAA 


TCCAAGTTTG 


480 


GGGCCAATCC 


ATCCTGGGTG 


TGTCTCTGGC CGTGTGTAAG 


GCANGGGCAA 


CTGAACNGGA 


540 


ACTGCCCCTG 


TATCGCCACA 


TTGCTCAGCT TGGNCGGGAA 


CTCANACCTC 


ATCCTGCCTG 


600 


TTGCCGGCCT 


TCAACGTGAT 


CAATGGTTGG CTTCTCATGC 


CTGGCAACAA 


ANCTGGCCAT 


660 


TGCNGGAATT 


TTCATGATCC 


TCCCCNTTGG GAAACTGAAA 


AACTTTCCGG 


AATGCCCNTC 


720 


CAACTAAGTT 


GCAAAAGGTC 


TACCNATACC CCCCAAGGGG 


AATTCCTCCA 


AGGGAACAAA 


780 



TNCCCGGGAA AGGAATGCCC CCCAATTNTT NGGGGGAATA AAAGGTGGGC TTTGCCCCCC 840 
CATTTTCCTG GAAAAAACNA TNAAAACCCT TGGGAAACTT Ron 



<210> 5 
<211> 645 
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<212> DNA 

<213> Homo sapiens 
<400> 5 

TGTGCGTTAC TTACCTCNAC TCTTAGCTTG TCGGGGACGG TAACCGGGAC CCGGTGTCTG 60 

CTCCTGTCGC CTTCGCCTCC TAATCCCTAG CCACTATGCG TGAGTGCATC TCCATCCACG 120 

TTGGCCAGGC TGGTGTCCAN ATTGGCAATG CCTGCTGGGA GCTCTACTGC CTGGAACACG 180 

GCATCCAGCC CGATGGCCAG ATGCCAAGTG ACAAGACCAT TGGGGGAGGA GATGACTCCT 240 

TCAACACCTT CTTCAGTGAG ACGGGCGCTG GCAANCACGT GCCCCGGGCT GTGTTTGTAG 300 

ACTTGGAACC CACAGTCATT GATGAAGTTC GCACTGGCAC CTACCGCCAG CTCTTCCACC 360 

CTGAGCAGCT CATCNCAGGC AAGGAAGATG CTGCCAATAA CTATGCCCGA GGGCACTACA 420 

CCATTGGCAA GGAGATCATT GACCTTGTGT TGGACCGAAT TCGCAAGCTG GCTGACCANT 480 

GCACCGGTCT TCANGGCTTC TTGGTTTTCC ACAGCTTTGG TGGGGGAACT GGTTCTGGGT 540 

TCACCTCCCT GCTCATGGAA CGTCTCTCAG TTGATTATGG CAAGAAATCC AAGCTGGAGT 600 

TCTCCATTTA CCCAGCACCC CNGGTTTCCN CNGCTGTANT TNGAA 645 

<210> 6 

<211> 820 

<212> DNA 

<213> Homo sapiens 

<400> 6 

CTTTTTTCGC AACGGGTTTG CCGCCAGAAC ACAGGTGTCG TGAAAACTAC CCCTAAAAGC 60 

CAAAATGGGA AAGGAAAAGA CTCATATCAA CATTGTCGTC ATTGGACACG TAGATTCGGG 120 

CAAGTCCACC ACTACTGGCC ATCTGATCTA TAAATGCGGT GGCATCGACA AAAGAACCAT 180 

9920750A1 I > 
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TGAAAAATTT GAGAAGGAGG CTGCTGAGAT GGGAAAGGGC TCCTTCAAGT ATGCCTGGGT 240 

CTTGGATAAA CTGAAAGCTG AGCGTGAACG TGGTATCACC ATTGATATCT CCTTGTGGAA 300 

ATTTGAGACC AGCAAGTACT ATGTGACTAT CATTGATGCC CCAGGACACA GAGACTTTAT 360 

CAAAAACATG ATTACAGGGA CATCTCAGGC TGACTGTGCT GTCCTGATTG TTGCTGCTGG 420 

TGTTGGTGAA TTTGAAGCTG GTATCTCCAA GAATGGGCAG ACCCGAGAGC ATGCCCTTCT 480 

GGCTTACACA CTGGGTGTGA AACAACTAAT TGTCGGTGTT AACAAAATGG ATTCACTGAN 540 

CCACCCTACA GCCAGAAGAA ATATGANGAA ATTGTTAAGG AAGTCAGCAC TTACATTAAG 600 

AAAATTGGCT ACAACCCCGA CACAGTANCA TTTGTGCCAA TTTCTGGTTG GAATGGTGAC 660 

AACATGCTGG AACCAANTGC TAACATGCCT TGGTTCCAGG GATGGAAAAT CCCCCNTTAA 720 

GGATGGCNAT GCCATTGGAA CCCCCCTGCT TGAAGGCTCT GGANTGCATC CTANCACCAA 780 

CTCCTTCAAA TTGAAAAACC CCTTGCNCCC GCCTCCNCCA 840 

<210> 7 



<211> 788 
<212> DNA 
<213> Homo sapiens 



<400> 7 

_GAGGCTGAGG CA GTGGCTCC TTGCACAGCA GCTGCACGCG CCCT BflCTRr GGATCTCTTC 
GTCTTTGCAG CGTAGCCCGA GTCGGTCAGC GCCGGAGGAC CTCAGCAGCC ATGTCGAAGC 
CCCATAGTGA AGCCGGGACT GCCTTCATTC AGACCCAGCA GCTGCACGCA GCCATGGCTG 
ACACATTCCT GGAGCACATG TGCCGCCTGG ACATTGATTC ACCACCCATC ACAGCCCGGA 
ACACTGGCAT CATCTGTACC ATTGGCCCAG CTTCCCGATC AGTGGAGACG TTGAAGGAGA 
TGATTAAGTC TGGAATGAAT GTGGCTCGTC TGAACTTCTC TCATGGAACT CATGAGTACC 
ATGCGGAGAC CATCAAGAAT GTGCGCACAG CCACGGAAAG CTTTGCTTCT GACCCCATCC 
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TCTACCGGCC CGTTGCTGTG GCTCTAGACA CTAAAGGACC TGAGATCCGA ACTGGGCTCA 
TCAAGGGCAG CGGCACTGCA GAGGTGGAGC TGAAGAATGG AGCCACTCTC AAAATCACGC 
TGGATAATGC CTACATGGAA AAGTGTGACG AGAACATCCT GTGGCTGGAC TACAAGAACA 
TCTGCAAGGT GGTGGAAGTG GGCAACAAGA TCTACGTGGA TGATGGGCTN ATTTCTCTCC 
AGGTGAACAC AAAGGTGCCG ACTTCCTGGG TGACNGANGT GGAAAATGGT GGCTCCTTGG 
GCNCAAGAAA GGTGTGAACT TCCTGGGGCT GCTGTGGANT TGCCTGCTGT GTCNGAAAAA 
GACATCCA 

<210> 8 

<211> 608 
<212> DNA 
<213> Homo sapiens 

<400> 8 

ACAGCCTGGC TCCTTTGAGT ATGAATATGC CATGCGCTGG AAGGCACTCA TTGAGATGGA 60 
GAAGCAGCAG CAGGACCAAG TGGACCGCAA CATCNAGGAG GCTCGTGAGA AGCTGGAGAT 120 
GGAGATGGAA GCTGCACGCC ATGAGCACCA GGTCATGCTA ATGAGACAGG ATTTGATGAG 180 
GCGCCAAGAA GAACTTCGGA GGATGGAAGA GCTGCACAAC CAAGANGTGC AAAAACGAAA 240 

GCAACTGGAG CTCA GGCAGG AGGAAN AGCG CAGGCGCCGT GAAGAANAGA TGCGG CGGCA 300 

GCAAGAAGAA ATGATGCGGC GACNGCAGGA AGGATTCAAG GGAACCTTCC CTGATGCGAG 
AGAGCAGGAG ATTCGGATGG GTCNGATGGC TATGGGAGGT GCTATGGGCA TAAACNACAG 
ATGTGCCATG CCCCCTGCTC CTGTGCCAGC TGGTACCCCA GCTCCTCCAG GACCTGCCAC 
TATTATGCCG GATGGAACTT TGGGATTGAC CCCACCNACA ACTGAACGCT TTGGTCNGGC 
TGCTACNATG GAANGAATTG GGGCAATTGG TGGAACTCCT CCTGCATTCN ACCGTGCAGC 
TCCTGGGA 
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480 
540 
600 
660 
720 
780 
788 



360 
420 
480 
540 
600 
608 
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<210> 9 



<211> 869 
<212> DNA 
<213> Homo sapiens 



<400> 9 

ATATTAAACT AGTGAAGCAA CTAAGAGAAA ATGTTAAGTC TGCTATTGAT CTTGAAGAGA 60 

TGGCATCTGG TCTTAACAAA AGAAAAATGA TTCAGCATGC TGTATTTAAA GAACTTGTGA 120 

AGCTTGTAGA CCCTGGAGTT AAGGCATGGA CACCCACTAA AGGAAAACAA AATGTGATTA 180 

TGTTTGTTGG ATTGCAAGGG AGTGGTAAAA CAACAACATG TTCAAAGCTA GCATATTATT 240 

ACCAGAGGAA AGGTTGGAAG ACCTGTTTAA TATGTGCAGA CACATTCAGA GCAGGGGCTT 300 

TTGACCAACT AAAACAGAAT GCTACCAAAG CAAGAATTCC ATTTTATGGA AGCTATACAG 360 

AAATGGATCC TGTCATCATT GCTTCTGAAG GAGTAGAGAA ATTTAAAAAT GAAAATTTTG 420 

AAATTATTAT TGTTGATACA AGTGGCCGCC ACAAACAAGA AGACTCTTTG TTTGAAGAAA 480 

TGCTTCAAGT TGCTAATGCT ATACAACCTG ATAACATTGT TTATGTGATG GATGCCTCCA 540 

TTGGGCAGGC TTGTGAAGCC CAGGCTAAGG CTTTTAAAGA TAAAGTAGAT GTACCTCAGT 600 

AATAGTGACA AAACTTGATG GCCATGCAAA ANGAAGTGGT GCACTCAGTG CAGTCGCTGC 660 

CACAAAAAAT CCGATTATTT TCAT TGGTAC AGGGGGAACA TATANATGAC TTTGAACCTT 720 

TCAAAAACAC AGCCTTTTAT TAACAAACTT CTTGGTATNG GCGACATTGA AAGGACTGAT 780 

AAATAAAGTC CACNAATTGA AATTTGGATG ACNATGNAAA CCCTTATTGA AAAAATTGAA 840 

ACATNGTCCA GTTTTACTTT GCGAAACNT r RQ 



<210> 10 
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<211> 813 
<212> DNA 

<213> Homo sapiens 
<400> 10 

GTTGTGGTAT CTGTATTAAG AAATGCCCCT TTGGCGCCTT ATCAATTGTC AATCTACCAA 60 

GCAACTTGGA AAAAGAAACC ACACATCGAT ATTGTGCCAA TGCCTTCAAA CTTCACAGGT 120 

TGCCTATCCC TCGTCCAGGT GAAGTTTTGG GATTAGTTGG AACTAATGGT ATTGGAAAGT 180 

CAACTGCTTT AAAAATTTTA GCAGGAAAAC AAAAGCCAAA CCTTGGAAAG TACGATGATC 240 

CTCCTGACTG GCAGGAGATT TTGACTTATT TCCGTGGATC TGAATTACAA AATTACTTTA 300 

CAAAGATTCT AGAAGATGAC CTAAAAGCCA TCATCAAACC TCAATATGTA GACCAGATTC 360 

CTAAGGCTGC AAAGGGGACA GTGGGATCTA TTTTGGACCG AAAAGATGAA ACAAAGACAC 420 

AGGCAATTGT ATGTCAGCAG CTTGATTTAA CCCACCTAAA AGAACGAAAT GTTGAAGATC 480 

TTTCAGGAGG AGAGTTGCAG AGATTTGCTT GTGCTGTCGT TTGCATACAG AAAGCTGATA 540 

TTTTCATGTT TGATGAGCCT TCTAGTTACC TAGATGTCAA GCAGCGTTTA AAGGCTGCTA 600 

TTACTATACG ATCTCTAATA AATCCAGATA GATATATCAT TGTGGTGGAA CATGATCTAA 660 

GTGTATTAGA CTATCTCTCC GACTTCATCT GCTGTTTATA TGGTGTACCA AGCGCCTATG 720 

GAATTGTCAC TATGCCTTTT AGTGTTAGAA AAGGCATAAA CNTTTTTTGG ATGGGTATGT 780 

TCCAACAGAA AACTTGANAA TCNNAAATGC NTC 813 



<210> 11 

<211> 655 
<212> DNA 
<213> Homo sapiens 
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<400> 11 












AGACTCTCAC 

AUAV lUl \Jt\\j 




AGftAAPGPPA 


P.PPPTTPAPP ppttppptpp 


TCCTTGGCTG 


60 


ACTCACCGCC 

L\\J 1 V/ i\\J \J\J\J\J 


PTPGPPGPPG 


PAPPATP.GAP 


rpppppurro apptpptpaa 
iittl/v/tAviliL AuliTGGTCAA 


CTTTGGGCCT 


120 


GGTCCCGCPA 


AGPTGPPGPA 


PTPAOTCTTfl 


TTAPAPATAP AAAAPPaatt 

1 lAUAUAlAvy AAAAGGAATT 


ATTAGACTAC 


180 


AAAGGAGTTG 


GPATTARTGT 


TPTTftA A ATP 
IuAAAIIj 


APTPAPAPPT f A TO A n i rnmm 

AUllAlAbul LATCAGATTT 


TGCCAAGATT 


240 


ATTAAPAATA 


PAGAGAATPT 




TTPPTAPPTP TTOAimni a 

ilbUAulTli TTCCAGACAA 


CTATAAGGTG 


300 


ATTTTTPTGP 

nil 1 i lul \J\j 


AARGAGGTHfi 
n/iuu/iuu luvi 


nT^pnpppAp 


lllAvilblllj TCCCCTTAAA 


CCTCATTGGC 


360 


TTGAAAGPAG 


u/inUu lul \J\j 


HP.APT ATPTP 


PTPAPAPPAP PTTDDmninn 

IiILiAIAIiIjAG LTTGGTCAGC 


TAAGGCCGCA 


420 


GAAPtAARPPA 


AHAAfiTTTfin 


P APTATA A AT 

uALlAlAAAl 


ATPPHTPAPO nm i i i/~immn/-i 

AlCbTTCACC CTAAACTTGG 


GAGTTATACA 


480 


AAAATTCCAG 


ATCCAAGCAC 


CTGGAACCTC 


AACCCANATG CCTCCTACGT 


GTTTTATTGC 


540 


NCAAATGAAA 


CGGTGCATGG 


TGTTGANTTT 


GACTTTATAC CCNATGTCAA 


GGGAACANTA 


600 


CTGGTTTGTG 


ACATTTTCCT 


CCAACTTCCT 


GTCCAANCCA ATTGNATGTT 


TCCAA 


655 



<210> 12 



<211> 599 
<212> DNA 

<213> Homo sapiens 



<400> 12 








AAAGATGCGC 


AGGCGCCGTG TGGCACTCGG CGGTCGAAAG GGGAGTTCAA 


GGAGACGGGG 


60 


GCGACGCGGC 


TGAGGGCTTC TCGTCGGGGT CGGGGCTGCA GCCGTCATGC 


CGGGGATAGT 


120 


GGAGCTGCCC 


ACTCTAGAGG AGCTGAAAGT AGATGAGGTG AAAATTAGTT 


CTGCTGTGCT 


180 


TAAAGCTGCG 


GCCCATCACT ATGGAGCTCA ATGTGATAAG CCCAACAAGG 


AATTTATGCT 


240 


CTGCCGCTGG 


GAANAGAAAG ATCCGAGGCG GTGCTTAGAG GAAGGCAAAC 


TGGTCAACAA 


300 


GTGTGCTTTG 


GACTTCTTTA GGCAGATAAA ACGTCACTGT GCAGAGCCTT 


TTACAGAATA 


360 
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TTGGACTTGC ATTGATTATA CTGGCCAGCA GTTATTTCGT CACTGTCGCA AACAGCAGGC 420 

AAAGTTTGAC NAGTGTGTGC TGGACAAACT GGGCTGGGTG CGGCCTGACC TGGGAAAACT 480 

GTCAAAGGTC ACCAAAGTGA AAACAGATCN ACCTTTACCG GANAATCCCT ATCACTCAAG 540 

AACAAGAACG GATCCCAGCC CTGANATCNA AGGAAATCTG CANCCTGCCA CACATGGCA 599 

<210> 13 

<211> 597 
<212> DNA 

<213> Homo sapiens 
<400> 13 

ATATCCGGAG TAGACGGAGC CGCAGTAGAC GGATCCGCGG CTGCACCAAA CACTGCCCCT 60 

CGGAGCCTGG TAGTGGGCCA CAAGCCCCCA GTCCCAGAGG CGTGATTTTC TGGCATCCTT 120 

AAATCTTGTG TCAAGGATTG GTTATAATAT AACCAGAAAC CATGACGGCG GCTGAGAACG 180 

TATGCTACAC GTTAATTAAC GTGCCAATGG ATTCAGAACC ACCATCTGAA ATTAGCTTAA 240 

AAAATGATCT AGAAAAAGGA GATGTAAAGT CAAAGACTGA AGCTTTGAAG AAAGTAATCA 300 

TTATGATTCT GAATGGTGAA AAACTTCCTG GACTTCTGAT GACCATCATT CGTTTTGTGC 360 

TACCTCTTCA GGATCACACT ATCAAGAAAT TACTTCTGGT ATTTTGGGAG ATTGTTCCTA 420 

AAACAA CTCC AGATGGGA GA CTTTTACATG AGATGATCCT TGTATGT GAT GCATACAGAA 480 

AGGATCTTCA ACATCCTAAT GAATTTATTC NAAGGATCTA CTCTTCGTTT TCTTTGCAAA 540 

TTGAAANAAA CANAATTGCT AAAACCTTTA ATGCCANCTA TNCCTGCATT TTTGGGA 597 

<210> 14 

<211> 634 
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<212> DNA 

<213> Homo sapiens 

<400> 14 

AGACTCTCAC CGCAGCGGCC AGGAACGCCA GCCGTTCACG CGTTCGGTCC TCCTTGGCTG 60 

ACTCACCGCC CTCGCCGCCG CACCATGGAC GCCCCCAGGC AGGTGGTCAA CTTTGGGCCT 120 

GGTCCCGCCA AGCTGCCGCA CTCAGTGTTG TTAGAGATAC AAAAGGAATT ATTAGACTAC 180 

AAAGGANTTG GCATTAGTGT TCTTGAAATG AGTCACAGGT CATCAGATTT TGCCAAGATT 240 

ATTAACAATA CAGAGAATCT TGTGCGGGAA TTGCTAGCTG TTCCAGACAA CTATAAGGTG 300 

ATTTTTCTGC AAGGAGGTGG GTGCGGCCAG TTCAGTGCTG TCCCCTTAAA CCTCATTGGC 360 

TTGAAAGCAG GAANGTGTGC GGACTATGTG GTGACAGGAG CTTGGTCAGC TAAGGCCGCA 420 

NAANAAGCCA AGAANTTTGG GACTATAAAT ATCGTTCACC CTAAACTTGG GAGTTATACA 480 

AAAATTCCAG ATCCAAGCAC CTGGAACCTC AACCCAGATG CCTCCTACGT GTATTATTGC 540 

GCNAATGAAA CNGTGCATGG TGTGGANTCT GACTTTATAC CCGATGTCNA GGGAACATAC 600 

TGGTTTGTGA CATGTCCTCA AACTTCCCGT CCNA 634 

<210> 15 

<211> 757 

<212> DNA 

<213> Homo sapiens 



<400> 15 

AGTCTGCGGT GGGCTANCGG ACGGTCCGGC TTCCGGCGGC CGTTTCTGTC TCTTGCTGGC 60 

TGTCTCGCTG AATCGCGGCC GCCTTCTCAT CGCTCCTGGA AGGTCCCGAG CGCGACACCA 120 

TGTCGGAACC CGGGGGCGGC GGCGGCGAAG ACNGCTCGGC CGGATTGGAA GTGTCGGCCG 180 
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TGCANAATGT GGCGGACGTG TCGGTGCTGC ANAAGCACCT GCGCAAGCTG GTGCCGCTGC 240 

TGCTGGAGGA CGGCGGCGAA GCGCCGGCCG CGCTGGAGGC GGCGCTGGAG GAGAAGAGCG 300 

CCCTGGAGCA GATGCGCAAG TTCCTTTCGG ACCCGCACGT CCACACGGTG CTGGTGGAGC 360 

GCTCCACGCT CAAAGTGGAC GTCGGTGATG AAGGAGAAGA AGAAAAAGAA TTCATTTCCT 420 

ATAACATCAA CNTAGACATT CACTATGGGG TTAAATCCAA TAGCTTGGCA TTCATTAAAC 480 

GTACTCCCGT GATTGATGCA GATAAACCCG TGTCTTCTCA NCTCCGGGTC CTTACACTCA 540 

GTGAANACTC NCCCTACNAA AACTTTGCAT TCTTTCATTA ACAATGCAGT GGCTCCTTTT 600 

TTTAANTCCT ACATTAAAAA ATCTGGCAAG GCAAACAGGG ATGGTGATAA AATGGCTCCT 660 

TCCNTTGAAA AAAAAATTGC CGAACTCNAA ATNGGACTCC TTCCCTTGCA NCAAAATTTT 720 

TGAAATTCCG GAAAATCANC CTGCCCAATT CCTCCCC 757 



<210> 16 



<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 16 

ATCATTTCCT TATTTATATT TCATGTTGGA ATGCTTAAAT CGATAACCTT TGTATTTTGA 60 

AGTGCGCGAC ATGGAAGGTG AT CTGCAAGA GCT GCATCAG TCAAACACCG GGGGATAAAT 1 20 

CTGGATTTGG GTTCCGGCGT CAAGGTGAAG ATAATACCTA AAGAGGAACA CTGTAAAATG 180 

CCAGAAGCAG GTGAANAGCA ACCACAAGTT TAAATGAAGA CAAGCTGAAA CAACGCAAGC 240 

TGGTTTTATA TTAGATATTT GACTTAAACT ATCTCAATAA AGTTTTGCAG CTTTCACCAC 300 
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<211> 313 
<212> DNA 



<213> Homo 
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AAAGATGGPR 


fimnnnn A CICl T A PPP A P A PP A PP a nnnnnn mn^mn#^ AMM ^ 

uLviuijijbAljlT lAubLAliAliC AGGACGCCGC TGCTGCCGCC 


GCCACCGCCG 


60 


LllL-ObtltL 


AGTCGCCTCC GGTCCTTCAA ACTCACACCT CCCGGGAGGA 


GCTGTCCTGG 


120 


CGCCGGGTCC 


CGCGGGGAAA ATGGTGGAGC CAGGGCAAGA TTTACTGCTT 


GCTGCTTTGA 


180 


GTGAGAGTGG 


AATTAGTCCG AATGACTCTT TGATATTGAT GGTGGAGATG 


CANGGCTTGC 


240 


AACTCCAATG 


CCTACCCCGT CAGTTCAGCA NTCAGTGCCA CTTANTGCAT 


TANAACTANG 


300 


TTTGGAGACC 


GAA 




313 
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<211> 667 
<212> DNA 
<213> Homo sapiens 

<400> 18 

ACT GCCGGGC TCGGCGTGAG TCGCTGCGGG GCTGACGGGG TGGCAGTGCG GCGGGTTACG 60 

GCCTGGTCAG ACCATAATGA CTTCAGCAAA TAAAGCAATC GAATTACAAC TACAAGTGAA 120 

ACAAAATGCA GAAGAATTAC AAGACTTTAT GCGGGATTTA GAAAACTGGG AAAAAGACAT 180 

TAAACAAAAG GATATGGAAC TAAGAAGACA GAATGGTGTT CCTGAAGAGA ATTTACCTCC 240 

TATTCGAAAT GGGAATTTTA GGAAAAAGAA GAAAGGCAAA GCTAAAGAGT CTTCCCCAAA 300 

ACCANAGAGG AAAACACNAA AAACAGGATA AAATCTTATG ATTATGANGC ATGGGCAAAA 360 

CTTGATGTGG ACCGTATCCT TGATGAGCTT GACAAAGACG ATAGTACCCA TGAGTCTCTG 420 
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TCTCAAGAAT CAGAGTCGGA AGAAGATGGG 
TTAAAAGAAA AGGGCNATAA ATACTTCCAC 
ACACNAAAGG CNTGGATGCC GATCCATATN 
CATATTTTAG ACTGAAAAAA TTTGCTGTTG 
TGAAATA 



ATTCATGTTG ATTCNCNAAA GGCTCTTGTT 480 
AAGGAAAATA TGATGAAGCA ATTGACTGCT 540 
ATCCCGTGTT GCCAACGAAC ANAACNTCCG 600 
CTGAATCTGA TTGTTATTTAN CANTTGCCT 660 

667 
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